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CLAIMS 

What is claimed as new and desired to be protected by Letters Patent of the United 

States is: 



1 LA method for decreasing the latency between an instruction cache and a pipeline 

2 processor having a plurality of parallel execution stages, each execution stage having a decode 

3 stage and an instruction queue for sequentially processing instructions being processed by said 

4 processor, comprising: 

5 determining whether said decode stage and instruction queue do not have valid data; and 

6 inserting instructions from said instruction cache in parallel to said decode stage and 

7 instruction queue when said decode stage and instruction queue contain invalid data. 
1 

1 2. A method for decreasing the latency between an instruction cache and said 

2 pipeline processor according to claim 1 further comprising: 

3 processing said cache instructions from said cache sequentially through said decode stage 

4 and instruction queue when valid data exists in said instruction queue. 

1 

1 3. A method for processing instructions in a pipelined processor having a series of 

2 pipelined stages which reduces latency between an instruction queue and a pipeline processor 

3 comprising: 

4 serially fetching a plurality of instructions to be executed in said pipeline processor from 

5 a cache memory; 

6 decoding each of said fetched instruction addresses in a first stage of said pipeline 

7 processor to determine if an execution branch is to be taken; 

8 loading said instruction into said instruction queue at the same time said instruction is 

9 being loaded in said decoder when said instruction queue and decoder are empty; 



12 
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10 sequentially loading said instruction into said instruction queue from said decoder when 

1 1 said instruction queue and said decoder are not empty; and 

12 shifting the contents of an instruction queue to produce an instruction from said 

13 instruction queue for processing in subsequent pipeline stages. 
1 

1 4. The method for processing instructions in a pipelined processor according to 

2 claim 3 wherein said decoder identifies said instructions loaded in said queue at the same time as 

3 said instructions which are loaded in said decoder as valid or invalid during a subsequent cycle 

4 of said pipeline processor if an execution branch is not taken. 
1 

1 5. The method for processing instructions in a pipelined processor according to 

2 claim 3 wherein said instruction queue contents are shifted left to an output port connected to 

3 plural pipelined processor stages. 
1 

1 6. A method for executing instructions in a pipelined processor comprising: 

2 sequentially fetching the addresses of instructions to be executed by said pipelined 

3 processor; 

4 determining if said instructions are stored in a cache memory; 

5 determining whether a decode stage and location of an instruction queue stage of said 

6 pipe line processor is empty; 

7 loading said instruction from said cache memory in said decode stage and said instruction 

8 queue in parallel when said stages are empty; and 

9 sequentially reading out said instruction queue instructions for execution in said pipelined 
10 processor. 

1 

1 
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1 7. The method for executing instructions in a pipelined processor according to claim 

2 6 further comprising: 

3 loading only said decode stage with said instructions when said instruction queue 

4 contains valid data, and sequentially transferring said instructions to said instruction queue when 

5 a position in said instruction queue is available. 
1 

1 8. The method for executing instructions in a pipelined processor according to claim 

2 7 further comprising identifying said instruction as a branch instruction if said decoder predicts 

3 that a branch is being taken from said instruction; and 

4 inhibiting transfer of subsequent instructions from said decoder to said instruction queue. 
1 

1 9. The method for executing instructions in a pipeline processor according to claim 7 

2 further comprising: 

3 fetching an instruction from a main memory when said instruction is not in said cache 

4 memory; and 

5 forwarding said instruction to said decode stage for sequential transfer to said instruction 

6 queue. 
1 

1 10. The method for executing instructions in a pipeline processor according to claim 6 

2 further comprising: 

3 examining the contents of a location in said queue; and 

4 determining the state of a valid bit in each of said locations whereby the determination of 

5 whether said location contains valid data is made. 
1 

1 11. The method for executing instructions in a pipeline processor according to claim 

2 7, wherein said instructions are transferred from said decode stage to said instruction queue each 

3 time an instruction is read from said instruction queue. 
1 

1 
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1 



12. An apparatus for reducing the latency between stages of a pipelined processor 



2 



comprising: 



3 



an instruction cache producing a plurality of instructions for execution by said pipelined 



4 



processor; 



5 



a plurality of decode stages connected to receive a plurality of instructions from said 



6 cache; 



7 



an instruction queue having a plurality of locations for receiving an instruction and a 



8 valid bit; and 



9 



a plurality of multiplexers for receiving each of said instructions, an output of a 



10 respective decode stage receiving said instructions, and connected to receive a valid bit from a 

1 1 location of said queue as a select signal, said multiplexer connected to supply each of said 

12 instruction queue locations with one of said instructions selected from either from said decoder 

13 stage or from said cache. 
1 

1 13. The apparatus according to claim 12, wherein said multiplexer receives a shift 

2 signal for said instruction queue which shifts the contents of said instruction queue towards an 

3 output port of said queue, and which enables said instructions from said decoder to be transferred 

4 to said instruction queue. 
1 

1 14. The apparatus according to claim 12 wherein said output port of said queue is 

2 connected to a plurality of parallel processing stages. 
1 

1 15. The apparatus according to claim 14 wherein said processing stages execute 

2 instructions belonging to one of a load/store operation, arithmetic operation, or a branch target 

3 instruction. 
1 
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